Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 380 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 38.7 KiB |
| Average record size in memory | 104.3 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 3 |
HS is highly correlated with HST | High correlation |
AS is highly correlated with AST | High correlation |
HST is highly correlated with HS | High correlation |
AST is highly correlated with AS | High correlation |
HS is highly correlated with HST | High correlation |
AS is highly correlated with AST | High correlation |
HST is highly correlated with HS | High correlation |
AST is highly correlated with AS | High correlation |
HS is highly correlated with HST | High correlation |
AS is highly correlated with AST | High correlation |
HST is highly correlated with HS | High correlation |
AST is highly correlated with AS | High correlation |
HS is highly correlated with HST | High correlation |
AS is highly correlated with AST and 1 other fields | High correlation |
HST is highly correlated with HS | High correlation |
AST is highly correlated with AS | High correlation |
HF is highly correlated with HY | High correlation |
AC is highly correlated with AS | High correlation |
HY is highly correlated with HF | High correlation |
AST has 4 (1.1%) zeros | Zeros |
HC has 6 (1.6%) zeros | Zeros |
AC has 6 (1.6%) zeros | Zeros |
HY has 94 (24.7%) zeros | Zeros |
AY has 64 (16.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-05 05:13:36.430439 |
|---|---|
| Analysis finished | 2022-04-05 05:13:57.596067 |
| Duration | 21.17 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 28 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.80263158 |
| Minimum | 1 |
|---|---|
| Maximum | 32 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 11 |
| median | 13.5 |
| Q3 | 17 |
| 95-th percentile | 23 |
| Maximum | 32 |
| Range | 31 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.77244658 |
|---|---|
| Coefficient of variation (CV) | 0.3457635272 |
| Kurtosis | 0.3982875386 |
| Mean | 13.80263158 |
| Median Absolute Deviation (MAD) | 3.5 |
| Skewness | 0.347270232 |
| Sum | 5245 |
| Variance | 22.77624635 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=28)
| Value | Count | Frequency (%) |
| 13 | 39 | 10.3% |
| 14 | 36 | 9.5% |
| 11 | 34 | 8.9% |
| 17 | 29 | 7.6% |
| 12 | 27 | 7.1% |
| 15 | 25 | 6.6% |
| 16 | 25 | 6.6% |
| 10 | 24 | 6.3% |
| 9 | 20 | 5.3% |
| 19 | 18 | 4.7% |
| Other values (18) | 103 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.3% |
| 2 | 1 | 0.3% |
| 3 | 2 | 0.5% |
| 4 | 3 | 0.8% |
| 5 | 3 | 0.8% |
| 6 | 12 | |
| 7 | 11 | |
| 8 | 13 | |
| 9 | 20 | |
| 10 | 24 |
| Value | Count | Frequency (%) |
| 32 | 1 | 0.3% |
| 27 | 2 | 0.5% |
| 26 | 3 | 0.8% |
| 25 | 1 | 0.3% |
| 24 | 6 | 1.6% |
| 23 | 8 | |
| 22 | 3 | 0.8% |
| 21 | 6 | 1.6% |
| 20 | 11 | |
| 19 | 18 |
| Distinct | 23 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.06315789 |
| Minimum | 2 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 11 |
| Q3 | 13 |
| 95-th percentile | 18 |
| Maximum | 25 |
| Range | 23 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.215339562 |
|---|---|
| Coefficient of variation (CV) | 0.3810249842 |
| Kurtosis | 0.3009830451 |
| Mean | 11.06315789 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.3844329432 |
| Sum | 4204 |
| Variance | 17.76908763 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=23)
| Value | Count | Frequency (%) |
| 10 | 40 | |
| 11 | 40 | |
| 13 | 38 | |
| 12 | 36 | |
| 9 | 32 | 8.4% |
| 7 | 31 | 8.2% |
| 14 | 21 | 5.5% |
| 8 | 18 | 4.7% |
| 16 | 18 | 4.7% |
| 6 | 17 | 4.5% |
| Other values (13) | 89 |
| Value | Count | Frequency (%) |
| 2 | 3 | 0.8% |
| 3 | 7 | 1.8% |
| 4 | 11 | 2.9% |
| 5 | 14 | 3.7% |
| 6 | 17 | |
| 7 | 31 | |
| 8 | 18 | |
| 9 | 32 | |
| 10 | 40 | |
| 11 | 40 |
| Value | Count | Frequency (%) |
| 25 | 1 | 0.3% |
| 24 | 3 | 0.8% |
| 23 | 2 | 0.5% |
| 21 | 2 | 0.5% |
| 20 | 2 | 0.5% |
| 19 | 7 | 1.8% |
| 18 | 7 | 1.8% |
| 17 | 15 | |
| 16 | 18 | |
| 15 | 15 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.657894737 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 14 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.428703269 |
|---|---|
| Coefficient of variation (CV) | 0.4477344476 |
| Kurtosis | -0.3033393372 |
| Mean | 7.657894737 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.3379770372 |
| Sum | 2910 |
| Variance | 11.75600611 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=19)
| Value | Count | Frequency (%) |
| 7 | 52 | |
| 8 | 42 | |
| 5 | 38 | |
| 6 | 35 | |
| 9 | 31 | |
| 10 | 30 | |
| 4 | 29 | |
| 11 | 24 | |
| 12 | 23 | |
| 3 | 23 | |
| Other values (9) | 53 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.3% |
| 1 | 5 | 1.3% |
| 2 | 14 | 3.7% |
| 3 | 23 | |
| 4 | 29 | |
| 5 | 38 | |
| 6 | 35 | |
| 7 | 52 | |
| 8 | 42 | |
| 9 | 31 |
| Value | Count | Frequency (%) |
| 18 | 1 | 0.3% |
| 17 | 1 | 0.3% |
| 16 | 4 | 1.1% |
| 15 | 7 | 1.8% |
| 14 | 9 | 2.4% |
| 13 | 11 | 2.9% |
| 12 | 23 | |
| 11 | 24 | |
| 10 | 30 | |
| 9 | 31 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.955263158 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 4 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 11 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.956250689 |
|---|---|
| Coefficient of variation (CV) | 0.4964097489 |
| Kurtosis | 0.6620360997 |
| Mean | 5.955263158 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.4886485985 |
| Sum | 2263 |
| Variance | 8.739418136 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) |
| 7 | 51 | |
| 4 | 48 | |
| 5 | 48 | |
| 6 | 44 | |
| 3 | 32 | |
| 9 | 32 | |
| 2 | 31 | |
| 8 | 30 | |
| 10 | 23 | |
| 1 | 14 | 3.7% |
| Other values (6) | 27 |
| Value | Count | Frequency (%) |
| 0 | 4 | 1.1% |
| 1 | 14 | 3.7% |
| 2 | 31 | |
| 3 | 32 | |
| 4 | 48 | |
| 5 | 48 | |
| 6 | 44 | |
| 7 | 51 | |
| 8 | 30 | |
| 9 | 32 |
| Value | Count | Frequency (%) |
| 20 | 1 | 0.3% |
| 14 | 3 | 0.8% |
| 13 | 3 | 0.8% |
| 12 | 3 | 0.8% |
| 11 | 13 | 3.4% |
| 10 | 23 | |
| 9 | 32 | |
| 8 | 30 | |
| 7 | 51 | |
| 6 | 44 |
| Distinct | 19 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.98684211 |
| Minimum | 3 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 8 |
| median | 11 |
| Q3 | 13 |
| 95-th percentile | 17 |
| Maximum | 21 |
| Range | 18 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.450721392 |
|---|---|
| Coefficient of variation (CV) | 0.3140776357 |
| Kurtosis | -0.5250702246 |
| Mean | 10.98684211 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.2594516024 |
| Sum | 4175 |
| Variance | 11.90747813 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=19)
| Value | Count | Frequency (%) |
| 11 | 39 | |
| 12 | 38 | |
| 10 | 38 | |
| 13 | 37 | |
| 9 | 36 | |
| 15 | 35 | |
| 8 | 34 | |
| 7 | 33 | |
| 6 | 24 | |
| 14 | 16 | 4.2% |
| Other values (9) | 50 |
| Value | Count | Frequency (%) |
| 3 | 1 | 0.3% |
| 4 | 2 | 0.5% |
| 5 | 9 | 2.4% |
| 6 | 24 | |
| 7 | 33 | |
| 8 | 34 | |
| 9 | 36 | |
| 10 | 38 | |
| 11 | 39 | |
| 12 | 38 |
| Value | Count | Frequency (%) |
| 21 | 1 | 0.3% |
| 20 | 1 | 0.3% |
| 19 | 6 | 1.6% |
| 18 | 4 | 1.1% |
| 17 | 10 | 2.6% |
| 16 | 16 | |
| 15 | 35 | |
| 14 | 16 | |
| 13 | 37 | |
| 12 | 38 |
AF
Real number (ℝ≥0)
| Distinct | 21 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.42894737 |
| Minimum | 2 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 11 |
| Q3 | 14 |
| 95-th percentile | 18 |
| Maximum | 24 |
| Range | 22 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.481228053 |
|---|---|
| Coefficient of variation (CV) | 0.304597435 |
| Kurtosis | 0.2190760594 |
| Mean | 11.42894737 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.219575488 |
| Sum | 4343 |
| Variance | 12.11894876 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=21)
| Value | Count | Frequency (%) |
| 10 | 50 | |
| 11 | 45 | |
| 13 | 40 | |
| 12 | 38 | |
| 9 | 36 | |
| 14 | 35 | |
| 8 | 30 | |
| 15 | 17 | 4.5% |
| 7 | 16 | 4.2% |
| 16 | 15 | 3.9% |
| Other values (11) | 58 |
| Value | Count | Frequency (%) |
| 2 | 1 | 0.3% |
| 3 | 3 | 0.8% |
| 4 | 4 | 1.1% |
| 5 | 9 | 2.4% |
| 6 | 9 | 2.4% |
| 7 | 16 | 4.2% |
| 8 | 30 | |
| 9 | 36 | |
| 10 | 50 | |
| 11 | 45 |
| Value | Count | Frequency (%) |
| 24 | 1 | 0.3% |
| 21 | 1 | 0.3% |
| 20 | 2 | 0.5% |
| 19 | 7 | 1.8% |
| 18 | 10 | 2.6% |
| 17 | 11 | 2.9% |
| 16 | 15 | 3.9% |
| 15 | 17 | |
| 14 | 35 | |
| 13 | 40 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.044736842 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 6 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 12 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.107671788 |
|---|---|
| Coefficient of variation (CV) | 0.5141120067 |
| Kurtosis | 0.5393424423 |
| Mean | 6.044736842 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.7041762858 |
| Sum | 2297 |
| Variance | 9.657623941 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 4 | 56 | |
| 5 | 56 | |
| 3 | 45 | |
| 8 | 40 | |
| 6 | 38 | |
| 7 | 36 | |
| 10 | 22 | 5.8% |
| 9 | 22 | 5.8% |
| 2 | 19 | 5.0% |
| 11 | 11 | 2.9% |
| Other values (8) | 35 |
| Value | Count | Frequency (%) |
| 0 | 6 | 1.6% |
| 1 | 9 | 2.4% |
| 2 | 19 | 5.0% |
| 3 | 45 | |
| 4 | 56 | |
| 5 | 56 | |
| 6 | 38 | |
| 7 | 36 | |
| 8 | 40 | |
| 9 | 22 | 5.8% |
| Value | Count | Frequency (%) |
| 17 | 1 | 0.3% |
| 16 | 3 | 0.8% |
| 15 | 2 | 0.5% |
| 14 | 3 | 0.8% |
| 13 | 3 | 0.8% |
| 12 | 8 | 2.1% |
| 11 | 11 | 2.9% |
| 10 | 22 | |
| 9 | 22 | |
| 8 | 40 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.997368421 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 6 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.834482995 |
|---|---|
| Coefficient of variation (CV) | 0.5671951227 |
| Kurtosis | 1.041808135 |
| Mean | 4.997368421 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.9055941766 |
| Sum | 1899 |
| Variance | 8.034293848 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) |
| 4 | 63 | |
| 3 | 56 | |
| 5 | 50 | |
| 6 | 44 | |
| 2 | 41 | |
| 7 | 39 | |
| 1 | 22 | 5.8% |
| 8 | 19 | 5.0% |
| 11 | 11 | 2.9% |
| 9 | 10 | 2.6% |
| Other values (6) | 25 | 6.6% |
| Value | Count | Frequency (%) |
| 0 | 6 | 1.6% |
| 1 | 22 | 5.8% |
| 2 | 41 | |
| 3 | 56 | |
| 4 | 63 | |
| 5 | 50 | |
| 6 | 44 | |
| 7 | 39 | |
| 8 | 19 | 5.0% |
| 9 | 10 | 2.6% |
| Value | Count | Frequency (%) |
| 16 | 2 | 0.5% |
| 14 | 1 | 0.3% |
| 13 | 3 | 0.8% |
| 12 | 5 | 1.3% |
| 11 | 11 | 2.9% |
| 10 | 8 | 2.1% |
| 9 | 10 | 2.6% |
| 8 | 19 | |
| 7 | 39 | |
| 6 | 44 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.413157895 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 94 |
| Zeros (%) | 24.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.16017264 |
|---|---|
| Coefficient of variation (CV) | 0.8209787772 |
| Kurtosis | 0.9185688125 |
| Mean | 1.413157895 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7573984279 |
| Sum | 537 |
| Variance | 1.346000555 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 2 | 102 | |
| 0 | 94 | |
| 3 | 49 | |
| 4 | 11 | 2.9% |
| 5 | 3 | 0.8% |
| 7 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 94 | |
| 1 | 120 | |
| 2 | 102 | |
| 3 | 49 | |
| 4 | 11 | 2.9% |
| 5 | 3 | 0.8% |
| 7 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 7 | 1 | 0.3% |
| 5 | 3 | 0.8% |
| 4 | 11 | 2.9% |
| 3 | 49 | |
| 2 | 102 | |
| 1 | 120 | |
| 0 | 94 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.839473684 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 64 |
| Zeros (%) | 16.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.30475879 |
|---|---|
| Coefficient of variation (CV) | 0.7093109303 |
| Kurtosis | 0.2904893543 |
| Mean | 1.839473684 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.543688834 |
| Sum | 699 |
| Variance | 1.702395501 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 2 | 113 | |
| 1 | 94 | |
| 3 | 72 | |
| 0 | 64 | |
| 4 | 26 | 6.8% |
| 5 | 8 | 2.1% |
| 6 | 2 | 0.5% |
| 7 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 64 | |
| 1 | 94 | |
| 2 | 113 | |
| 3 | 72 | |
| 4 | 26 | 6.8% |
| 5 | 8 | 2.1% |
| 6 | 2 | 0.5% |
| 7 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 7 | 1 | 0.3% |
| 6 | 2 | 0.5% |
| 5 | 8 | 2.1% |
| 4 | 26 | 6.8% |
| 3 | 72 | |
| 2 | 113 | |
| 1 | 94 | |
| 0 | 64 |
HR
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 KiB |
| 0 | |
|---|---|
| 1 | 29 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 380 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 380 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 380 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 351 | |
| 1 | 29 | 7.6% |
AR
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 KiB |
| 0 | |
|---|---|
| 1 | 30 |
| 2 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 380 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 380 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 380 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 348 | |
| 1 | 30 | 7.9% |
| 2 | 2 | 0.5% |
FTR
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 KiB |
| H | |
|---|---|
| D | |
| A |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 380 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | H |
|---|---|
| 2nd row | H |
| 3rd row | D |
| 4th row | H |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| H | 179 | |
| D | 111 | |
| A | 90 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| h | 179 | |
| d | 111 | |
| a | 90 |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 179 | |
| D | 111 | |
| A | 90 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 380 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 179 | |
| D | 111 | |
| A | 90 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 380 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 179 | |
| D | 111 | |
| A | 90 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 179 | |
| D | 111 | |
| A | 90 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| HS | AS | HST | AST | HF | AF | HC | AC | HY | AY | HR | AR | FTR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 23 | 12 | 11 | 2 | 15 | 15 | 16 | 7 | 1 | 2 | 0 | 0 | H |
| 1 | 7 | 17 | 2 | 12 | 19 | 14 | 1 | 3 | 2 | 1 | 0 | 0 | H |
| 2 | 13 | 12 | 9 | 7 | 12 | 13 | 4 | 8 | 1 | 3 | 0 | 0 | D |
| 3 | 18 | 10 | 13 | 4 | 10 | 10 | 3 | 1 | 1 | 0 | 0 | 0 | H |
| 4 | 6 | 13 | 2 | 7 | 13 | 10 | 3 | 6 | 3 | 3 | 1 | 0 | D |
| 5 | 22 | 11 | 18 | 7 | 13 | 16 | 10 | 3 | 0 | 2 | 0 | 0 | D |
| 6 | 11 | 9 | 6 | 7 | 8 | 11 | 6 | 4 | 1 | 1 | 0 | 0 | A |
| 7 | 13 | 10 | 7 | 6 | 17 | 13 | 5 | 5 | 0 | 2 | 0 | 0 | H |
| 8 | 7 | 14 | 4 | 7 | 13 | 15 | 9 | 11 | 1 | 3 | 1 | 1 | D |
| 9 | 18 | 7 | 10 | 3 | 9 | 5 | 5 | 3 | 2 | 2 | 0 | 0 | H |
Last rows
| HS | AS | HST | AST | HF | AF | HC | AC | HY | AY | HR | AR | FTR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 370 | 10 | 9 | 8 | 4 | 12 | 12 | 4 | 5 | 2 | 2 | 0 | 0 | H |
| 371 | 16 | 16 | 8 | 10 | 11 | 10 | 6 | 5 | 1 | 1 | 1 | 0 | A |
| 372 | 11 | 19 | 3 | 9 | 12 | 14 | 6 | 5 | 3 | 2 | 1 | 0 | H |
| 373 | 10 | 12 | 6 | 6 | 12 | 5 | 5 | 3 | 2 | 1 | 1 | 0 | D |
| 374 | 18 | 14 | 11 | 7 | 8 | 5 | 8 | 8 | 2 | 1 | 0 | 0 | H |
| 375 | 15 | 13 | 10 | 7 | 5 | 8 | 7 | 6 | 0 | 0 | 0 | 0 | D |
| 376 | 11 | 11 | 5 | 9 | 10 | 9 | 5 | 3 | 1 | 4 | 0 | 0 | A |
| 377 | 22 | 7 | 16 | 3 | 5 | 15 | 7 | 5 | 0 | 3 | 0 | 0 | H |
| 378 | 17 | 17 | 12 | 12 | 7 | 8 | 4 | 6 | 0 | 2 | 0 | 0 | A |
| 379 | 12 | 13 | 5 | 10 | 12 | 9 | 8 | 3 | 3 | 1 | 0 | 0 | A |